AITopics | teddy bear

3a7f9e485845dac27423375c934cb4db-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 12:15:54 GMT

exemplar, in-context exemplar, layoutgpt, (14 more...)

Neural Information Processing Systems

Country: Europe > Switzerland > Zürich > Zürich (0.14)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

ConMe: RethinkingEvaluationofCompositional ReasoningforModernVLMs-SupplementaryMaterial-AnonymousAuthor(s) Affiliation Address email

Neural Information Processing SystemsFeb-9-2026, 21:16:26 GMT

As an example, for an image taken on the ground, two text options24 are: {Several vehicles providing ground transportation are shown in25 the photo: streetcar, tour bus, classic car, and family cars.}

artificial intelligence, conme, natural language, (9 more...)

Neural Information Processing Systems

Country: Europe > Switzerland > Zürich > Zürich (0.17)

Industry:

Transportation > Ground (0.57)
Transportation > Passenger (0.53)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.35)

Add feedback

After a teddy bear talked about kink, AI watchdogs are warning parents against smart toys

The GuardianNov-28-2025, 13:00:07 GMT

'Children could become attached to a bot rather than a person or imaginary friend, which could hurt their development.' 'Children could become attached to a bot rather than a person or imaginary friend, which could hurt their development.' Advocates are fighting against the $16.7bn global smart-toy market, decrying surveillance and a lack of regulation As the holiday season looms into view with Black Friday, one category on people's gift lists is causing increasing concern: products with artificial intelligence. The development has raised new concerns about the dangers smart toys could pose to children, as consumer advocacy groups say AI could harm kids' safety and development. The trend has prompted calls for increased testing of such products and governmental oversight.

artificial intelligence, social media, teddy bear, (13 more...)

The Guardian

Country:

Europe > Ukraine (0.06)
Oceania > Australia (0.05)
North America > United States > Texas > Travis County > Austin (0.05)
(2 more...)

Industry:

Law (1.00)
Leisure & Entertainment > Sports (0.71)
Government > Regional Government > North America Government > United States Government (0.31)

Technology:

Information Technology > Communications > Social Media (0.73)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.51)

Add feedback

8caa38721906c1a0bb95c80fab33a893-Supplemental.pdf

Neural Information Processing SystemsNov-15-2025, 01:06:22 GMT

V100 GPUs to train the models. Consortium and are licensed under a Creative Commons Attribution 4.0 License. Similarly, for evaluating the agent listener with a human speaker, each agent evaluates 400 human utterances in Fig 5b. In Fig 10, we present the results of the human evaluation on the text game. Sec 4.3, we show that agents trained using our method beat all prior baselines when paired with both The blue bars show the standard deviation across all agents present in the buffer.

artificial intelligence, machine learning, utterance, (19 more...)

Neural Information Processing Systems

Industry: Information Technology (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.55)

Add feedback

28aad3b3b315d86910d7f4ee2867dfa4-Supplemental-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsOct-9-2025, 21:32:45 GMT

partition, question ask, vlm, (14 more...)

Neural Information Processing Systems

Country: Europe > Switzerland > Zürich > Zürich (0.15)

Industry: Transportation > Passenger (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.52)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.37)

Add feedback

Supplementary Material for LayoutGPT: Compositional Visual Planning and Generation with Large Language Models Anonymous Author(s) Affiliation Address email A Implementation Details 1

Neural Information Processing SystemsOct-8-2025, 11:53:33 GMT

Table 1: The prepending instructions provided to GPT -3.5/4 during our LayoutGPT's 2D and 3D T ask Instruction for GPT -3.5/4 2D Layout Planning Instruction: Given a sentence prompt that will be used to generate an image, plan the layout of the image. Formally, each line should be like "object {width:?px; height:?px; left:?px; top:?px; }". Formally, each line should follow the template: FURNITURE {length:?px:

exemplar, in-context exemplar, layoutgpt, (13 more...)

Neural Information Processing Systems

Country: Europe > Switzerland > Zürich > Zürich (0.14)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

8caa38721906c1a0bb95c80fab33a893-Supplemental.pdf

Neural Information Processing SystemsAug-15-2025, 20:21:35 GMT

V100 GPUs to train the models. Consortium and are licensed under a Creative Commons Attribution 4.0 License. Similarly, for evaluating the agent listener with a human speaker, each agent evaluates 400 human utterances in Fig 5b. In Fig 10, we present the results of the human evaluation on the text game. Sec 4.3, we show that agents trained using our method beat all prior baselines when paired with both The blue bars show the standard deviation across all agents present in the buffer.

artificial intelligence, machine learning, utterance, (19 more...)

Neural Information Processing Systems

Industry: Information Technology (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.55)

Add feedback

When Cars Have Stereotypes: Auditing Demographic Bias in Objects from Text-to-Image Models

Choi, Dasol, Lee, Jihwan, Lee, Minjae, Kahng, Minsuk

arXiv.org Artificial IntelligenceAug-12-2025

While prior research on text-to-image generation has predominantly focused on biases in human depictions, we investigate a more subtle yet pervasive phenomenon: demographic bias in generated objects (e.g., cars). We introduce SODA ( Stereotyped O bject D iagnostic A udit), a novel framework for systematically measuring such biases. Our approach compares visual attributes of objects generated with demographic cues (e.g., "for young people") to those from neutral prompts, across 2,700 images produced by three state-of-the-art models (GPT Image-1, Imagen 4, and Stable Diffusion) in five object categories. Through a comprehensive analysis, we uncover strong associations between specific demographic groups and visual attributes, such as recurring color patterns prompted by gender or ethnicity cues. These patterns reflect and reinforce not only well-known stereotypes but also more subtle and unintuitive biases. We also observe that some models generate less diverse outputs, which in turn amplifies the visual disparities compared to neutral prompts. Our proposed auditing framework offers a practical approach for testing, revealing how stereotypes still remain embedded in today's generative models. We see this as an essential step toward more systematic and responsible AI development.

demographic bias, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2508.03483

Genre: Research Report > Experimental Study (0.46)

Industry: Transportation > Ground > Road (0.47)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

SPARK: Graph-Based Online Semantic Integration System for Robot Task Planning

Shirasaka, Mimo, Ikeda, Yuya, Matsushima, Tatsuya, Matsuo, Yutaka, Iwasawa, Yusuke

arXiv.org Artificial IntelligenceJun-26-2025

The ability to update information acquired through various means online during task execution is crucial for a general-purpose service robot. This information includes geometric and semantic data. While SLAM handles geometric updates on 2D maps or 3D point clouds, online updates of semantic information remain unexplored. We attribute the challenge to the online scene graph representation, for its utility and scalability. Building on previous works regarding offline scene graph representations, we study online graph representations of semantic information in this work. We introduce SPARK: Spatial Perception and Robot Knowledge Integration. This framework extracts semantic information from environment-embedded cues and updates the scene graph accordingly, which is then used for subsequent task planning. We demonstrate that graph representations of spatial relationships enhance the robot system's ability to perform tasks in dynamic environments and adapt to unconventional spatial cues, like gestures.

information, large language model, natural language, (15 more...)

arXiv.org Artificial Intelligence

2506.20394

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.84)
Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.78)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.51)

Add feedback

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Wang, Weiyun, Chen, Zhe, Wang, Wenhai, Cao, Yue, Liu, Yangzhou, Gao, Zhangwei, Zhu, Jinguo, Zhu, Xizhou, Lu, Lewei, Qiao, Yu, Dai, Jifeng

arXiv.org Artificial IntelligenceNov-15-2024

Existing open-source multimodal large language models (MLLMs) generally follow a training process involving pre-training and supervised fine-tuning. However, these models suffer from distribution shifts, which limit their multimodal reasoning, particularly in the Chain-of-Thought (CoT) performance. To address this, we introduce a preference optimization (PO) process to enhance the multimodal reasoning capabilities of MLLMs. Specifically, (1) on the data side, we design an automated preference data construction pipeline to create MMPR, a high-quality, large-scale multimodal reasoning preference dataset. and (2) on the model side, we explore integrating PO with MLLMs, developing a simple yet effective method, termed Mixed Preference Optimization (MPO), which boosts multimodal CoT performance. Our approach demonstrates improved performance across multiple benchmarks, particularly in multimodal reasoning tasks. Notably, our model, InternVL2-8B-MPO, achieves an accuracy of 67.0 on MathVista, outperforming InternVL2-8B by 8.7 points and achieving performance comparable to the 10x larger InternVL2-76B. We hope this study could inspire further advancements in MLLMs. Code, data, and model shall be publicly released.

large language model, machine learning, preprint arxiv, (19 more...)

arXiv.org Artificial Intelligence

2411.10442

Country:

South America > Brazil (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
Europe > Latvia (0.04)
(10 more...)

Genre: Research Report (0.82)

Industry:

Health & Medicine (0.93)
Education > Educational Setting > K-12 Education (0.92)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback

Filters

Collaborating Authors

teddy bear

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

3a7f9e485845dac27423375c934cb4db-Supplemental-Conference.pdf

ConMe: RethinkingEvaluationofCompositional ReasoningforModernVLMs-SupplementaryMaterial-AnonymousAuthor(s) Affiliation Address email

After a teddy bear talked about kink, AI watchdogs are warning parents against smart toys

8caa38721906c1a0bb95c80fab33a893-Supplemental.pdf

28aad3b3b315d86910d7f4ee2867dfa4-Supplemental-Datasets_and_Benchmarks_Track.pdf

Supplementary Material for LayoutGPT: Compositional Visual Planning and Generation with Large Language Models Anonymous Author(s) Affiliation Address email A Implementation Details 1

8caa38721906c1a0bb95c80fab33a893-Supplemental.pdf

When Cars Have Stereotypes: Auditing Demographic Bias in Objects from Text-to-Image Models

SPARK: Graph-Based Online Semantic Integration System for Robot Task Planning

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization